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Abstract: We derive the relationship of the redshift and the angular diameter dis- 
tance to the average expansion rate for universes which are statistically homogeneous 
and isotropic and where the distribution evolves slowly, but which have otherwise ar- 
bitrary geometry and matter content. The relevant average expansion rate is selected 
by the observable redshift and the assumed symmetry properties of the spacetime. 
We show why light deflection and shear remain small. We write down the evolu- 
tion equations for the average expansion rate and discuss the validity of the dust 
approximation. 
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1. Introduction 

A factor of two. The early universe from at least big bang nucleosynthesis onwards 
is well described by a model where the geometry is locally spatially homogeneous 
and isotropic up to linear perturbations, the matter consists of a gas of particles 
with positive pressure, and the relation between geometry and matter is given by 
the Einstein equation based on the four- dimensional Einstein-Hilbert action. How- 
ever, at late times such a model underpredicts the distance to far-away sources and 
the expansion rate. Compared to the simplest possibility, the spatially flat matter- 
dominated model, the discrepancy is a factor of about two in both the distance (for 
a fixed Hubble constant) and the expansion rate (for a fixed energy density or age 
of the universe). Therefore at least one of the three assumptions -homogeneity and 
isotropy, standard matter content and standard gravity- is wrong, assuming that 
light propagation is correctly modeled by null geodesies. No deviations from stan- 
dard gravity have been observed in local physics, not in the solar system (apart from 
the Pioneer anomaly and the fiyby anomaly, where the possibility of systematics is 
not ruled out) nor in pulsars [1,2]. Neither is there any detection of an effect of 
exotic matter with negative pressure on local physics. The factor of two discrepancy 
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only appears in observations of distance and expansion rate which involve quanti- 
ties integrated over large scales^. This situation is quite different from that of dark 
matter, for which there is evidence from various different systems on several scales. 

While there is no evidence against standard general relativity or standard mat- 
ter apart from the increased distance and expansion rate, the universe is known to 
be locally far from homogeneity and isotropy due to the formation of non-linear 
structures at late times. It is possible that the breakdown of the homogeneous and 
isotropic approximation could explain the failure of the prediction of homogeneous 
and isotropic models with ordinary matter and gravity [4-9]. The effect of inho- 
mogeneity and/or anisotropy on the evolution of the universe was first discussed in 
detail in [10] under the name "the fitting problem", and the effect on the average 
expansion rate is known as backreaction [11-15]. It has been shown with toy mod- 
els that inhomogeneities can lead to accelerating expansion [14, 16-19], but whether 
this happens for the distribution of structures present in the real universe is not yet 
clear. The order of magnitude of the observed change in the expansion rate and the 
correct timescale of around 10 billion years do emerge from the physics of structure 
formation in a semi-reahstic model [20,21], but there is no fully reahstic calculation 
yet. 

Light propagation and statistical homogeneity and isotropy. Most cosmo- 
logical observations probe quantities related to light propagation, such as the redshift, 
the angular diameter distance (or equivalently the luminosity distance) and image 
distortion. In linearly perturbed homogeneous and isotropic Friedmann-Robertson- 
Walker (FRW) models, the redshift and the distance are to leading order determined 
by the expansion rate and the spatial curvature. The corrections due to the per- 
turbations are small for typical light rays, and are important only for the cosmic 
microwave background (CMB), whose redshift anisotropics are very accurately mea- 
sured, and image distortion, which is zero for the background and remains small when 
perturbations are included. (At least this is the case when the perturbations are sta- 
tistically homogeneous and isotropic and the homogeneity scale is small; see [22] for 
a counterexample with a large spherically symmetric structure.) 

The fact that the optical properties of a FRW universe can be expressed in 
terms of the expansion rate and the spatial curvature is rather obvious, because 
light propagation is, for small wavelengths, purely geometrical, and these are the 
only degrees of freedom in the FRW geometry. In a general spacetime, the situation 
is more involved. Nevertheless, if the distribution of the geometry is statistically 
homogeneous and isotropic, it could be expected that light propagation over distances 
longer than the homogeneity scale can to a first approximation be similarly described 

^It has been argued that locally repulsive gravity has been observed in the motions of galaxies 
near the Local Group [3]. This is an interesting possibility, but the present data is not precise 
enough for such a detection. 
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with a few quantities related to the overall geometry, regardless of complicated local 
details [20]. Light propagation in statistically homogeneous and isotropic universes 
with irrotational dust as the matter content was studied in [23], where it was argued 
that if the distribution evolves slowly compared to the time it takes for light to 
cross the homogeneity scale, then the redshift and the angular diameter distance are 
determined by the expansion rate as a function of redshift and the matter density 
today. The study [23] had three shortcomings. 

First, it was assumed that the variation in the spatial direction of the null 
geodesies (i.e. light deflection) is small. The magnitude of the null shear was also left 
undetermined. Observationally, both light deflection and image distortion are known 
to be small for typical light rays [24], and it should be established that this follows 
from the symmetry properties of the spacetime. Second, the treatment of matter 
as irrotational dust is not locally vahd [25,26], because effects such as rotation and 
velocity dispersion are important for stabilising structures on small scales. The vor- 
ticity and non-dust nature of the matter content may be expected to be unimportant 
for the overall cosmological evolution in the real universe at late times. Neverthe- 
less, such effects should be included to establish under which conditions they can 
be neglected, and put the dust approximation on better footing. Including matter 
other than dust is also necessary for treating backreaction in the early universe such 
as during inflation [12,27-30] or preheating. Third, the arguments were qualitative 
and the corrections to the mean behaviour were not determined. 

We now remedy the first and second problems. We derive results for light prop- 
agation, including deflection and shear, using only assumptions about the symmetry 
of the spacetime geometry and matter content. We consider general matter content 
and include rotation. Concentrating on observable quantities related to light propa- 
gation, we show how the relevant averaging hypersurface is given by the statistical 
symmetry of the spacetime. However, our analysis is not more quantitative than [23], 
and the arguments should be followed up with a more rigorous study. 

In section 2 we go through our assumptions, set up the covariant formalism and 
derive results for the redshift, the deflection, the null shear and the angular diameter 
distance. In section 3 we derive the evolution equations for the scale factor, which 
generahse the Buchert equations of the irrotational dust case [11], and consider the 
validity of the dust approximation. In section 4 we discuss the possible effect of the 
discreteness of the matter content, the relevance of average quantities and the FRW 
description, and summarise the situation. 

2. Light propagation 
2.1 Spacetime geometry 

Statistical symmetry. We assume that there exists a fohation of the spacetime 
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into spatial hypersurfaces of statistical homogeneity and isotropy, which we denote 
by M . The time which is constant on such a hypersurface is denoted t, and when 
referring to a particular hypersurface, we use the notation J\f{t). By this we mean 
that when we consider any region larger than the homogeneity scale, the average 
quantities within the region do not depend on its location, orientation or size. In 
other words, over large scales, there are no preferred locations or directions, and 
no correlations. Locally the dynamics can be complex, as the assumption of sta- 
tistical homogeneity and isotropy only concerns average quantities evaluated over 
large scales. The frame of statistical homogeneity and isotropy may not locally co- 
incide with either the Eckart frame (where there particle number flux is zero) or the 
Landau-Lifshitz frame (where the energy flux is zero) [31]. However, statistical ho- 
mogeneity and isotropy does imply that the integrated flux of any quantity through 
the boundary of a volume larger than the homogeneity scale vanishes. 

In this view, the universe consists of identical (up to statistical fluctuations) 
boxes stacked next to each other. In the real universe, there are correlations even 
over scales longer than the Hubble scale, due to inflation (or some other process in 
the early universe) which produces a large region that is exactly homogeneous and 
isotropic except for linear perturbations. The distribution of the perturbations is 
statistically homogeneous and isotropic. When the perturbations become non-linear 
at late times (in typical supersymmetric dark matter models, the first structures form 
around a redshift of 40-60 [32]), local homogeneity and isotropy are lost, but the dis- 
tribution of non- linear structures remains statistically homogeneous and isotropic, 
and the amplitude of correlations is small beyond the homogeneity scale. What one 
finds as the homogeneity scale depends on the limit that one sets for this amplitude. 
Based on the fractal dimension of the point set of galaxies, it has been argued that 
the distribution becomes homogeneous on a scale of around 100 Mpc to an accuracy 
of about 10% [33]. However, there are still large fiuctuations on 100 Mpc scales, 
and it has been argued that the sample size is not large enough to establish that the 
distribution is self-averaging, which is a necessary condition for statistical homogene- 
ity [34,35]. Studies of morphology also suggest that the homogeneity scale could be 
300 Mpc or more [36]. 

Statistical homogeneity and isotropy is formulated in terms of spatial hypersur- 
faces, but fight travels along null geodesies, not in a spacelike direction. Therefore 
we also need information about the evolution which relates one hypersurface to the 
next. We assume that the evolution is slow in the sense that the timescale of change 
in the spatial distribution is much larger than the homogeneity scale. Phrased dif- 
ferently, the variation of the geometry along a null geodesic is rapid compared to the 
scale over which the mean varies significantly. In the real universe, the timescale for 
change in the distribution of matter and geometry is the Hubble time H~^. Today 
Hq^ — 3000/i~^Mpc (with h somewhat below unity [37]), much larger than 100-300 
Mpc. In the past, the homogeneity scale was even smaller relative to the Hubble 
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scale, as structure formation was less advanced. 

The combination of statistical homogeneity and isotropy on spatial hypersurfaces 
and slow evolution from one hypersurface to the next can be heuristically thought 
of as a distribution that is statistically approximately homogeneous and isotropic 
in four dimensions when considering scales larger than the homogeneity scale, but 
smaller than the timescale of change in the distribution. The notion of statistical 
homogeneity and isotropy in general spacetimes should be made more rigorous, and 
the role of slow evolution in the arguments we make below on light propagation 
should be quantified. 

The two frames. We denote the vector normal to jV by and the velocity of the 
observers by w". Both are normalised to unity, Uarf — Uau" — —1. The observer 
velocity is completely general, it is not assumed to be geodesic or irrotational. For 
reviews of the covariant approach we use, see [38-42] ; for the relation to the ADM 
formalism [43], see [44]. The tensors which project on the hypersurface orthogonal 
to and the rest space orthogonal to are, respectively, 

haP = gap + nan/3 

h^al = 9ap + UaUp , (2.1) 

where Qa/s is the spacetime metric. The restriction of the projection tensor hap to J\f 
is the metric on J\f. The spatial derivative of a scalar is defined as V^/ = hJ^V/sf, 
for vectors we have Vpfa = hph^Vsfj: ^-^d similarly for higher order tensors. The 
spatially projected traceless part of a tensor is /(q/j) = h^^h^-j^f^s — \haph'^^ f-yS- 
The volume element on H is Cap-^ = ijap-ysn^, where rjap-ys is the spacetime volume 
element. The derivative with respect to the proper time s of the frame of statistical 
homogeneity and isotropy is n^Va, and it is denoted by an overdot. Unless = 0, 
the proper time s does not coincide with the time t which is constant on J\f. We can 
write Ua = —t^^dat. The derivative with respect to t is m"Va, with = t~^n". 
We define V = —nam", so F = = dts and = Fn". Physically, F describes the 
time dilation due to the non-geodesic motion of the frame; we have ha — Va In F. 
(Note that a F which depends only on t corresponds to a different time coordinate, 
not different physics.) Only if = can we choose F = 1 and s — t, which is 
equivalent to the statement that A/" is a hypersurface of constant proper time. In 
addition to s and t, we also have the proper time of the observers, defined by u". 

Because the timescale for the evolution of structures is determined by their 
proper time, the hypersurface of statistical homogeneity and isotropy could be ex- 
pected to coincide with the hypersurface of constant proper time of observers co- 
moving with the structures, as argued in [14,20,23]. However, if the matter consists 
of several components which form structures differently, the situation is not so sim- 
ple. For example, in the real universe, dark matter and baryons cluster differently 
(though the differences are not expected to be important on scales larger than the 
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homogeneity scale). And on small scales, dark matter is multistreaming, so there is 
more than one proper time associated with the matter flow at a single point. We 
keep the hypersurface of statistical homogeneity and isotropy arbitrary. 

Without loss of generality, we write the observer velocity w° in terms of n" and 
a component orthogonal to n", 

M° = 7(n° + t;") , (2.2) 

where 7 = —UaU" = (1 — v"^)^^^"^, with = VaV" and Van" = 0. Note that 
is not the peculiar velocity, either in the perturbation theory sense of a velocity 
with respect to a fictitious background, or in the physical sense of deviation from 
a shearfree velocity field [45]. The quantity v" measures the deviation of the local 
observer velocity from the time direction set by the frame of statistical homogeneity 
and isotropy. Even if is zero, there can be arbitrarily large spatial variations in the 
expansion rate. We will see that a large v implies significant anisotropy in the CMB. 
We therefore often take v to be small, and expand to first order in v. We use ~ to 
indicate equality up to and including terms first order in f (We do not assume that 
derivatives of v" are small.) Physically, this means that the motion of the observers 
with respect to the frame of homogeneity and isotropy is non-relativistic. 

Fluid kinematics. The covariant derivative of ra" can be decomposed as 

V/3n„ = ^haf^e + aap - haUp , (2.3) 

where 6 = Vq^" = Vo-n" is the volume expansion rate and aaf3 = ^ {pna) = ^(^/sUa) 
is the shear tensor. The tensor a^is and the acceleration vector ra" are spatial in 
the sense that they are orthogonal to n", aapu'^ = 0, na'n°' = 0. The shear scalar 
is defined as = \aa(3cr°'^ . Because n° is hypersurface-orthogonal, it follows from 
Frobenius' theorem that the vorticity Uap = ^[/sna] +?^[a'^/3] = ^[pna] is zero [38-40], 
[46] (page 434). 

The covariant derivative of the observer velocity u°' can be analogously decom- 
posed with respect to itself, 

V,«„ = Ih^y-^ + + eg - Aau, , (2.4) 
where 9^^^ = V^m", a^^^ = /ig/igV^M^ - le^^^h^:^ = V[/3Wa] + A^^up] and 

Given ( |2.2| ), the expansion rates in the two frames are related as (see [42] for the 
expressions for the acceleration, shear and vorticity) 

-6 + Vav"" + nav"" + Vav"" . (2.5) 
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The energy-momentum tensor. In the geometrical optics approximation, light 
propagation is kinematical, and independent of the laws which determine the evo- 
lution of the geometry. However, we prefer to replace the Einstein tensor with the 
energy-momentum tensor which describes the matter content, and to do that we 
assume that the geometry is related to the matter by the Einstein equation, 

Gal3 = 87rG^Ta/3 , (2.6) 

where Gap is the Einstein tensor, Gn is Newton's constant, and Ta/3 is the energy- 
momentum tensor. 

Without loss of generality, the energy-momentum tensor can be decomposed 
with respect to as 

Ta/3 = P^'^Wnp + P^'^^Kp + 2q'^^np^ + vrj;^ , (2.7) 

where p*^"^ = n'^n^Tap is the energy density, p*^"^ = ^h'^^Tap is the pressure, gi""* = 
—h^rCTp^ is the energy flux and vr^*^^ = h^h^T^s — ^h^ph'^^T^s = Ti^^p) is the 
anisotropic stress. Both gi""* and vr^'^-' are spatial in the sense that q^^n°' = 0, ir^^/jn^ = 
0. The quantities measured by the observers are given by the decomposition with 
respect to m". 

Tap = p^'^^uaup + + 2q^^up) + vrg , (2.8) 

where p^^\p'^^\q^a^ and tt^^^ are defined analogously to the ra" frame quantities. 
Locally, dust is defined as matter for which p'-^^ q^^ and ir'^^p are zero; it then follows 
from the equations of motion that A" is also zero. In the frame, the non-dust 
terms have a clear physical interpretation in terms of what the observers measure. 
Such terms can arise from the properties of matter (it may be that the matter cannot 
be treated as dust in any frame) and from the fact that an ideal fluid looks non-ideal 
to a non-comoving observer. We discuss treating the matter approximately as dust 
in section |3^ . 

We could equally take ( p.7|) and ( |2.8| ) as decompositions of the Einstein tensor 
rather than the energy-momentum tensor. We use assumed symmetry properties of 
( |2.7| ) such as the absence of preferred directions over large distances, and these could 
be equally phrased in terms of the geometry expressed in Gap- However, Tap is more 
transparent because it can be understood in terms of a matter model. 

2.2 Photon energy and redshift 

The photon momentum. We want to relate quantities integrated along null 
geodesies to average quantities which characterise the spatial geometry. We use 
assumptions about the symmetry properties of the spacetime, so averages are most 
meaningfully discussed in terms of quantities on M and the vector ra". In contrast. 
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the observable redshift and light deflection are deflned by the observer velocity -u". 
(The angular diameter distance and the null shear scalar are independent of the 
velocity field [47].) 

In the geometrical optics approximation light travels on null geodesies [48] (page 
570), [49] (page 93). We do not consider caustics, which are not expected to be 
important for typical hght rays in cosmology (though see [50]). For treatment of the 
CMB in the covariant formahsm, see [31,51,52]. The null geodesic tangent vector is 
given by the gradient of the phase of the wave, identified with the photon momentum, 
and denoted by /c". It satisfies kak" = and k^VakP = 0. The redshift plus one is 
proportional to the energy measured by the observer, 1 + ^ oc which in turn is 

£;H = -M^A;" . (2.9) 

The photon momentum can be decomposed into an amplitude and the direction, 
and the direction can be split into components parallel and orthogonal to u'^, 

A;" -£;(")(«" + r") , (2.10) 

with MQ,r" = 0, rQ,r" = 1. 

Because the vector n°' is adapted to the symmetry of the spacetime, it is more 
convenient to calculate quantities in the n°' frame and then transform to the u°' 
frame. The decomposition of A;" with respect to reads 

r = (n" + e") , (2.11) 

with E^"'^ = —nak^^nae"" = 0, e^e" = 1. The quantities E^^^ and e" do not have a 
straightforward observational interpretation, unlike E^'"^ and r°'. 
The observed energy is given in terms of E^'^^ by 

£;H =7(l-^;„e")£;(") , (2.12) 

and the observed direction r" is related to by 

7(1 - vpeP) 

~ (1 + vpe'^)e'^ + vpe^n" - -y" . (2.13) 

The inverse relation is 

= ^(m" + r") - 7-'m" + 

7 + vprP 

~ (1 - Vfsr^y'^ - vpr^u'' + v'^ . (2.14) 
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Statistical homogeneity and isotropy. We obtain the evolution of E^"-^ by op- 
erating with the derivative along the null geodesic, -4- = fc"Va. Denoting drj = 



(n" + e")9a and using ( |2.3| ) and ( |2.11| ), we have 



(2.15) 



which integrates into 



(rvo r 1 

= E(")(to,a;o)exp " dtr 
^E(")(to,a;o)exp (^j\t^{re) 



-9 + n„e" + a„;3e"e^ 



(2.16) 



where the integral is along the null geodesic and the subscript refers to the observer's 
position and time. On the second line we have taken the time t as the integration 
variable; the spatial coordinates x on J\f are understood as functions of t on the null 
geodesic. We have then taken into account that if there are no preferred directions 
in the geometry of Af over long distances, and the direction e" changes only little or 
evolves much more slowly than the distribution of the geometry (we discuss this in 
section |2.3| ), the dominant contribution is given by the average expansion rate. (We 
use the symbol ~ to indicate dropping terms which are suppressed due to statistical 
homogeneity and isotropy, in contrast to ^, which indicates dropping terms which 
are small because v 1.) The argument for this is the following [23]. If Tn" has no 
preferred orientation, it points equally in the directions along and opposite to e", so 
its contribution vanishes. Similarly, Taaf3 contributes only via its trace, which is zero. 
The term then gives the dominant contribution. Under the assumption that the 
timescale for the evolution of the distribution of the geometry is much larger than 
the time it takes for light to cross the homogeneity scale, the integral is dominated 
by the average value of TO, as the contributions of the variation around the average 
cancel. (See section ^ for details of the averaging.) In reality, there is some evolution 
of the quantities along the null geodesic, and the cancellations are not perfect, so 
the contributions of rri^e", Taaise^e^ and of the variation of TO are only suppressed 
instead of zero. 

The observed energy is, using ( p.l2| ). 



^E(")(to,a^o)7(l-^a 



e") exp 



to 



dt-{Te) 



(2.17) 
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and the redshift l + z = E^^^^rj) / E^''\riQ) is 



l + z 



7o(l - Wae°)|o 



7(1 - Va.e'^) 



exp 



(2.18) 



Expressing e" in terms of the observed direction r° with ( |2.14|) , it is transparent 
that there are large observed anisotropics in the redshift of isotropically distributed 
sources unless v is small or constant or there is a conspiracy of cancellations. Con- 
versely, if V is small, the anisotropy is small, even though the variations in the 
geometry can be large. In particular, the near-isotropy of the CMB does not imply 
that the universe would be nearly FRW [23,53]. Assuming f <^ 1, the correction 
due to V reduces to faC^lo — foC" ~ far"|o — far". The first term is the dipole due 
to the motion of the observer with respect to the frame of statistical homogeneity 
and isotropy, and the second, which can have arbitrary angular dependence, is the 
corresponding term at the source. These are in addition to the usual dipole due to 
the difference between the velocity of the observer and the source. As long as the 
difference between and n" is small, the difference in the redshift between the two 
frames is small, even though the expansion rates and 0^"^^ can be very different, as 
the gradient of f " can be large even when v is small. 

The local environment. When we argue for the cancellation of terms other than 
(r^) in the integral ( ^.16| ) due to symmetry, this only applies to propagation over 
distances longer than the homogeneity scale, and deviations due to the local environ- 
ment are not accounted for. For example, in linearly perturbed FRW spacetimes, the 
shear term aapc'^e^ contains the usual local dipole, which we have neglected. To be 
consistent in our approximation of concentrating on propagation over long distances 
and neglecting the effect of the local environments near the source and the observer, 
we should approximate 1 — e°'Va + faC^lo — 1- In the real universe, this approxima- 
tion seems to hold well. The velocity difference between the CMB frame and our 
rest frame is of the order 10~^, and the rest frame of local large-scale structures is 
also near the CMB frame [54]. The effect of the local environment is likely to be 
small as long as structures are small compared to the distance the light travels and 
the observer is not in a special location [20]. This is true for the structures which 
are known to exist and which are expected in usual models of structure formation, 
but may not be valid for speculative large spherical structures, often described with 
the Lemaitre-Tolman-Bondi (LTB) model [55,56]. 

For the CMB anisotropics, the corrections due to the local environment and the 
deviations around the mean cannot be neglected. They are important for the low 
multipoles, as in the Integrated Sachs- Wolfe effect and the Rees-Sciama effect, and 
could be related [57] to observed violations of statistical isotropy of the CMB [58]. 
Formalism for the CMB in the case when the geometry is not perturbatively near 
FRW has been developed in [31]. 
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The mean redshift and the scale factor. The redshift characterises a single 
geodesic (or more accurately, two points and two frames along a single geodesic), so 
its spatial average is not well defined. However, it is useful to introduce the "mean 
redshift" z by 

l + z = exp |^^°dt^(r0)^ . (2.19) 

The physical interpretation of 1 + ^ is that if we take any two points on M(f) and 
M{t^ which are connected by a null geodesic (or several), the redshift along the null 
geodesic(s) is z plus small corrections (assuming that the rest frames of the source 
and the observer are close to the frame of statistical homogeneity and isotropy). 
From the arguments above and the observational fact that the CMB deviations from 
isotropy and from the blackbody shape of the spectrum are small [59] we know that 
temperature differences between different spatial locations are small, and the mean 
value of the redshift gives the dominant contribution. 
We define the scale factor a as (setting a{t^ = 1) 



a{t) = {1 + z)-^ = exp J dt-{Te)j . (2.20) 

The quantity 6 gives the change of rate of the local volume element with respect to 
the proper time s, so TO gives the rate of change with respect to t. Therefore a(t)^ 
is proportional to the volume of A/": the mean redshift is determined by the change 
of the overall volume of space. 

The change of the redshift of a given source with time, called redshift drift 
[60], has been suggested as a test of the FRW metric and LTB models [61,62]. 
Essentially, the change of redshift with time tests the relationship 1 + z = a{t)~^ 
between the redshift and the scale factor, with the scale factor associated with an 
average expansion rate. In the present case, unlike in LTB models, the relationship 
between the mean redshift and the average expansion rate is the same as in FRW 
models. (See section p.l| for discussion of the average expansion rate.) However, 



because redshift drift is a small effect, the variations around the mean would have 
to be considered carefully to make a prediction. 

2.3 Deflection 



Picard's proof. In deriving ( p. 16 ) it was assumed that the spatial direction of 



the null geodesic does not change rapidly along the geodesic. The direction e° is 
the direction of the null geodesic projected on J\f, so it enters into the arguments 
about cancellation due to symmetry. The direction in turn describes the apparent 
position of the source as seen by the observer, so it is an observable, and the change in 
r" is called the deflection. As we do not have information about the 'original' position 
of the source, i.e. the position in the hypothetical situation that the spacetime would 
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be flat along the photon path, the deflection can only be measured statistically, unless 
the apparent position changes on the timescale of the observation, as in microlensing. 

Let us look at the change of e" and along the null geodesic. As with the 
redshift, it is simpler to consider e" and then relate it to r°. It is convenient to choose 
a non-coordinate basis and introduce tetrads adapted to the 1+3 decomposition 
[40,63,64]. We denote the tetrad basis by t"^, with rj^^taAipB = Qapii^A^aB = Vab 
as usual, where rjAs = diag(— 1, 1, 1, 1). We use capital Latin letters to denote 
components in the tetrad basis, e.g. = e°'tj^. 

The change of along the null geodesic is given by 



dA 



+e^(nBe« + a^ce^e^) - a%e" . (2.21) 

where we have on the second line inserted = '^k'^ — inside the co variant 
derivative and used (p^.3|) and (|2.15|). We specialise the choice of basis by taking 



ia — ^a, so that = 6^^,nA = Sao- (In a coordinate basis, this choice is 
not possible in general [38,39].) Then is zero for A = 0, while for the spatial 
components (which we denote by small Latin letters from the middle of the alphabet, 
e* = j^e^) we obtain, using the definition of the covariant derivative, 

^(n)-ide^ = (n^ + e^)VBe^ 
dA 

= d,e' + (n^ + e^)reBe^ 

= dr,e' + a' + n'je^ - aje^e' - e'j,,N{e''e^ , (2.22) 

where the connection components T^^q have been expressed in terms of the de- 
composition ( |2.3|) of Vb^a as well as an object Oj, a symmetric object Nij and an 
antisymmetric object Qij; see [40,64] for details^. Together with the 9 degrees of free- 
dom in {6, 77,*, 0"*^}, the 12 degrees of freedom in {a\ N^j, ^^j} completely characterise 
the spacetime geometry. 

Putting ( |2.21| ) and ( p.22| ) together, we have a system of ordinary differential 



equations for the components e* 

^fiv.e^)- (2.23) 

If the geometry is statistically homogeneous and isotropic and its distribution evolves 
slowly, the change in e* remains small due to the lack of preferred directions in 



^In the notation of [40,64], fiy = eijk^^ 
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{n*, a*, a^j, N'^j, ^^j}- This can be expressed more formally as follows. According to 
Picard's theorem, the system of equations (|2.23| ) has a unique solution given by an 
iteration (see e.g. [65], page 19). Let us define e^^^-^^ 

with e^Q-) = e\rio). The solution to ( p.23|) is given by the iV — >• oo limit. At first step. 



we have 



rv . rv rv 

6(1) = ejo) - / dr]'{h' + a') - e\^^ / dr/'(a^ + fi'^) + ejojej^) / dTi'{hj + a^) 
•J m -^vo J m 

+e(o)4)4o) I' dr/V,, + efo)e[o) d^e^ ,,Ar^, . (2.24) 



"(o)'^(o)'^(o) / "ife "T ^(0)^(0) 

J 7)0 JTjo 



As with ( |2.16| ), we could write drj = Tdt, with the understanding that the spatial 



coordinates are functions of t along the null geodesic. If and a* have no preferred 
direction and evolve slowly, their integral vanishes, provided the distance over which 
the integral is taken is longer than the homogeneity scale. In practice, the cancellation 
is not perfect, because there is evolution and statistical fluctuations, so the integral is 
simply suppressed. Similarly, all diagonal components of any tensor should contribute 
almost equally to the integrals and the contribution from non-diagonal components 
should be suppressed. Because cT*j,f2*j and j^N^ are traceless, their contributions 
are suppressed. One can repeat the argument at every iteration to conclude that the 
solution e' is e*^Q-) plus a small deviation. (For the linear term in ( ^.23| ) this is obvious, 
as it simply gives the ?7-ordered exponential of the integral of cr*^ + fi*j-) The change 
in the observed position of the source is then also small as long as v is small, as 
we see from (p.l3|) . This qualitative understanding should be made more rigorous, 
and the amplitude and the distribution of the deflection should be evaluated. 

2.4 Null shear and angular diameter distance 

The null shear. The distortion of the size and shape of the source image are 
described by the null expansion rate 6 (or equivalently the angular diameter distance 
Da) and the null shear tensor dap- To find these quantities, we need to introduce a 
tensor which projects onto a two-space orthogonal to the null geodesic, hapk^^ = 
0. Analogously to ( ^.3[ ) and ( |2.4| ), the covariant derivative of fc" can be decomposed 
as follows: 

= ^ha/sO + &al3 + k(^aPl3) , (2.25) 

where the trace 6 = h'^pVak^ = ^ak"" is the expansion rate of the area of the null 
geodesic bundle, aa/3 = h^hJV -yks — \haj36 is the null shear and Pa is a vector which 
depends on the choice of hap and plays no role in what follows. We have d'a/sk'^ = 0, 
Pq/c" = 0. The null geodesic vorticity is zero, because fc" is a gradient. The null 
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shear scalar is defined as = ^cjabc^^^ ■ The area expansion rate 6 is related to the 
angular diameter distance by (see e.g. [49,66]) 

oc exp ( i / dA^) . (2.26) 



The angular diameter distance Da and the shear amplitude aa^a"^ do not de- 
pend on hap- They also do not depend on the observer velocity, so it is enough to 
look at the frame without having to transform to the frame at the end. It is 
again convenient to use tetrads instead of sticking to a coordinate basis. We choose 
Has to be orthogonal to both and e^, 

hAB = Qab + UaUb - CaCb 

= QAB - E^-^~\AkB + 2E^^^'\(AnB) , (2.27) 



We proceed with aAB the same way as we did with e"^ in section p^.3| . Taking the 
derivative along the null geodesic and projecting, we obtain (see e.g. [49,66]) 

^(A^^gf = ~^^AB + Cab , (2.28) 

where 

Cab = k^k'^h(A^B)'CMCND 

= 2E^'^^\^hj,f (^EcD + hcDe'''e''EMN - icAiH 



M 
D 



(2.29) 



where Cab has been expressed in terms of the electric and magnetic components 
of the Weyl tensor, Eab = CACBorfn^ = Caobo, Hab = ^(^A'^^CcDBEn^ = 
\^a^^Ccdbo-, and eAB = ^abc^'^ ■ 



On the other hand, from the definition of the covariant derivative we obtain 

I ) 

BC) 



(see [40, 64] for the decomposition of 



d^~aAB + {2^^J + 2efc[,iV/ + e,,uN\e') h^^'a^^' . (2.30) 



Note that in a tetrad basis contracting with Hab commutes with 9^, but not with 

_d_ 



■jr. We denote indices on the space orthogonal to n"^ and with small Latin letters 



from the beginning of the alphabet, e'^ = h'^^e^ , and specialise the choice of basis by 
taking = so that = 5^^,eA = 5^13. Putting together ( p. 2^ ) and ( p. 301) , we 
have 

a,a,5 = -E^'^^'^Oaat + E^^^~'Cab + (2ne(a + 2id[cN\,] + ~ec(aN\)a,f . (2.31) 
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Like equation (|2.23 ) for the components of the deflection, ( p. 31 ) is an ordinary 



differential equation for the two independent components of dab- However, we cannot 
straightforwardly apply Picard's theorem. First, ( |2.31| ) contains the unknown 6. 
Even if we include the equation ( p.32|) given below to obtain a closed first order 
system of equations, Picard's theorem does not apply, because it assumes that the 
variables remain bounded, whereas the initial condition for the area expansion rate 
is 6'(?7o) = — oo at the observer. 

Nevertheless, the reasoning about cancellations due the lack of preferred direc- 
tions still holds, because the solution depends on the source term Cab only via an inte- 
gral. This is transparent with the change of variable aab = ^at + / dr]E^"'^ Cat- We 
now argue as before that due to statistical homogeneity and isotropy Cab contributes 
dominantly via its trace, which is zero, so dab ~ ^ab- This eliminates the source 
term in ( |2.31| ). Given the initial condition T,ab{Vo) = 0; we obtain dab ~ ^ab ~ 0. 
As with the deflection, this argumentation needs to be made more rigorous, and the 
amplitude of the small shear that is generated should be calculated. 



The angular diameter distance. Applying the derivative to 9, we obtain the 
evolution equation for the area expansion rate (see e.g. [49,66]) 

^ = -C^sk^k^ - Ta^ - 
dX 2 

= -STrCf^TABk^k^ - 2^^ - 

= -87rGN(p(") + - 2gJ)e^ + vrfie^e^)^^' - - ^6^ , (2.32) 
where we have used the Einstein equation ( p.6| ) and applied the decomposition 



of the energy-momentum tensor. As discussed in section ^Tl|, we could equally regard 
( |2.7| ) as the decomposition of the Einstein tensor (or, in the present context, the Ricci 
tensor, as the trace does not contribute to (|2.32|) ). 



Using (|2.26| ), we obtain from ( p.32|) the evolution equation for the angular diam- 



eter distance: 

^ = - [47rGN(p(") + - 2q^^^e^ + 7^%e^e^)E^-~^' + a'] . (2.33) 

The solution again depends on the source functions only via an integral. This is 
transparent with the change of variable 6 = Q — SirC^ J dA(p("^ + p^"^ — 2q^^''e^ + 



'^AB 



J^e^e-^)^^")^ We can write 

dA(p(") + - 2gJ)e^ + 7rJ]e^e^)E(")' 
E(")(r7o) / dta-^(r(p(") +p("))) 



^("^(r^o) J dta-i(r(p(") +p(")) + ^TF) , (2.34) 
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where F = f^(p(") + p^")) + 2'^~^q)^'v^ + n^^^v^^ ~ 2q^'v^. On the second hne 
we have taken into account dA = E''-"^ dr] and the approximate scahng E^^'^ 
E^"'\T]o)a~^. We have also again apphed the reasoning that statistical homogeneity 
and isotropy together with slow evolution implies that the contributions of q^^^e"^ and 
TT^^e^^e^ are suppressed, and that the dominant contribution of p^^^-\-p^^^ comes from 
the average, Finally, we have written p^^^ in terms of frame quantities using 

Q, (^, (|^) and (^). Dropping the null shear, the solution to (^) depends 
only on the quantity (|2.34|) and A = J dr]E^^^ ^ E^^^'qo)"^ J dta{T), where we have 
assumed that F has a statistically homogeneous and isotropic distribution and varies 
slowly, so that the integral is dominated by the average value. Because both ( p.34| ) 
and A depend approximately only on t (and E^'^\t]q)), so does 9. Writing ( p.32|) as 
an integral equation, dropping subdominant parts which depend on position, taking 
the time derivative and expressing the equation in terms of the angular diameter 
distance, we obtain 



Hd, [{T)-\l + zfHd.DA] = -47rGN(F(p(«) + p^")) + ^FF)^^ 

^ -47rGN(F(p(") + hqfv^)DA , (2.35) 



where the notation DA{t) refers to the dominant part of the angular diameter dis- 
tance with the corrections to the mean dropped, and we have used the relation 
BiDa = —(1 + z)HdzDA, with H = dta/a. The quantity Da has a similar physical 
interpretation as z: if we take any two points on A/'(t) and A/'(to) connected by a null 
geodesic (or several of them), the angular diameter distance along the geodesic(s) is 
Da plus small corrections. As noted in [23], while Da{z) is well-defined, there does 
not exist a function Da{z) even along a single geodesic, because the redshift is in 
general not monotonic along the null geodesic^. 

Apart from F and F, the equation ( |2.35|) for the mean distance in terms of z is 



the same as in FRW spacetimes. However, the relation between {p^^^ and z 

is different than in the FRW case, as we discuss in the next section (see also [23]). 
As with the redshift and the deflection, it is important to make the arguments about 
the null shear and the angular diameter distance more rigorous by evaluating the 
variation around the mean. Observationally, the angular diameter distance is known 
not to vary much with direction [67]. 



^In [23] it was incorrectly claimed that when the factor in the parenthesis on the right-hand side 
of ( 2.33| ) is positive, Da would be monotonic, because the initial condition (at the observer) for 



is negative. However, because A decreases along the null geodesic away from the observer, this 
only implies that has at most one zero, so Da is separately monotonic on at most two sections 
of the null geodesic, as is well known from FRW spacetimes. In the present case, the sign of the 
factor in the parenthesis is not determined, and the number of zeros of ^^j^ is not limited. 
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3. The average expansion rate and the scale factor 



3.1 The average expansion rate 

Defining the average. We have started with hght propagation, which involves the 
observed quantities directly, and have been led to consider averages. The average of 
a scalar / on is 

= (3.1) 

where t is constant on A/". Recall that in general, t is not a proper time. In particular, 
it is neither the proper time associated with n°' nor the proper time measured by the 
observers. The commutation rule between averaging and taking a derivative with 
respect to t is 

dt{f) = {dj) + {Tef)-{f){Te) (3.2) 

The scale factor a was defined in (|2.2CI| ) to give the mean redshift. From the def- 
inition it follows that is proportional to the volume of N'. The average expansion 
rate of interest is 

3^ = (r^) 

a 

= (r7-i0(") - V„(rt;") - T-f\v^v'' + f VV^f^j)) 

~ (r^H - V„(rf°) - Tv^v'') , (3.3) 

where we have used ( |2.5| ) and the relation ria = r~^VQ,r. In the irrotational ideal 
fluid case, the scale factor was originally defined as the volume of the hypersurface 
orthogonal to m", or equivalently by using the average of 6"-"^ with the lapse function 
included [11,12]. We start from light propagation, and while is the relevant veloc- 
ity field for the redshift, the symmetry of the spacetime selects FVo-n" = TO as the 
relevant local expansion rate. Locally 9 can be very different from 9^^\ even for small 
V, because the derivatives of v" can be large. However, using Gauss' theorem [46] 
(page 433) the total derivative in ( p.3|) can be converted into a surface integral which 
describes the flux of Ff ° through the boundary. If the distribution is statistically 
homogeneous and isotropic, there should be an equal flux through the surface in 
both directions, so the integral vanishes (up to statistical fluctuations). Therefore, 
the difference between the average quantities (F^) and {T9^'^^) is suppressed by v. 

3.2 Evolution equations for the scale factor 

The average equations. We now write down the evolution equations for the scale 
factor a, or equivalently for the average expansion rate. These generalise the Buchert 
equations derived for irrotational dust [11]. In [12] the average equations were written 
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down in the irrotational ideal fluid case using the ADM formalism, assuming that the 
averaging hypersurface is orthogonal to the fluid flow^. In [69] the average equations 
were derived (also in the ADM formalism) for an ideal fluid including rotation, taking 
the expansion rate to be h'^^VpUa = 0^^^ + Uaii", and keeping the hypersurface of 
averaging arbitrary. (The formalism was applied to second order perturbation theory 
in [70], with the hypersurface fixed by the condition H^is = 0. Averaging in second 
order perturbation theory was also considered in [71], with different hypersurfaces 
fixed by coordinate conditions.) In [72], the average equations were derived for an 
ideal fluid in the covariant formalism, with an arbitrary averaging hypersurface. We 
consider general matter content, an arbitrary hypersurface of averaging and use the 
covariant formalism. 

Combining the Einstein equation ( p.6| ) with the Bianchi and Ricci identities 
for n°, the evolution equations can be conveniently written in terms of the decom- 
positions ( p.3| ) and ( |2.7| ) and the electric and magnetic components of the Weyl 
tensor [40-42]. We are only interested in the three scalar equations 

e + -e^ = -47rG'N(p^"^ + 3p^"^) - 2(7^ + h^h" + V„n° (3.4) 
3 

V = 87rGNp(")-i(^)i? + a2 (3.5) 

p(n) + Q^p(n) ^ p(n)) _ - V^?^"^" - 2n«g(")" - a^^vT^")"^ . (3.6) 

where ^^^R is the spatial curvature of M . If A/" is a hypersurface of constant proper 
time, then = 0, and the equations differ from the irrotational dust case only by 
the pressure term in the Raychaudhuri equation ( p.4| ) and the non-dust terms in 
the conservation law ( [3.61 ). In terms of the Hamiltonian constraint ( ^.5] ), the only 
difference is the different evolution of the energy density. We keep n" arbitrary. 
Changing to derivatives with respect to t, averaging, applying the commutation rule 
( p.2|) and using the relations {TO) = 3dta/a and = F^^VoF, we obtain 

3^ = -47rG'N(p(") + 3p(")) + (n„n") + (V„n°) + Q 
a 

+(l(r2 _ 1)^2 ^ _ Y-^)rdte + w) (3.7) 

3^^ = 87rGN(p(")) - le^R) - k + - 1)^^) (3.8) 

2 2 6 

-(V,(rg(")'^)) , (3.9) 



■^These average equations, apart from the conservation law of the energy-momentum tensor, 
were written down in the context of general irrotational matter content in [68]. Note that the 
perturbative calculation in [68] is incorrect, because the averages of both first order terms and 
intrinsic second order terms taken in the perturbed spacetime are neglected in comparison with 
the averages of squares of first order terms. In fact, all these terms are of the same order, and the 
distinction between them is gauge-dependent [9]. 
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where 

Q = ^ {{Ve^) - {Ve)^) - 2{a^) . (3.10) 

In [12,68,69], the equivalent of the factors of T were put inside the averages of the 
terms which appear on the right-hand side of ( p.4|) and ( p.5|) , rather than the left- 
hand side. Inserting ( p.4|) into the last term of ( |3.7] ) and (|3.5| ) into the last term 
of ( p. 81 ) would recover that form of the equations. However, the present convention 
keeps the VoTt." term and the last term of ( |3.9| ) as total derivatives, which we can 
neglect. 

Because the backreaction variable Q is a statistical quantity which characterises 
the distribution of the spatial geometry, it is appropriate to give it in terms of 9 
and 0", which are related to n". However, it seems more appropriate to express the 
energy-momentum tensor in terms of the decomposition with respect to from the 
point of view of estimating the magnitude of the different terms. Using ( |2.1| ), ( p.2| ), 
( P^.TD and ([^.8D, we obtain 

3^ = _47rGN(p(") + 3p(")) + + (V,n") + Q 

a 

+ (l(r2 _ 1)^2 ^ _ Y~^)Tdt9 + 9dtT) - 8nG^{F) (3.11) 

= 87rGN(p("^) - li^'^R) -Iq+ - 1)9') + SttG^F) (3.12) 

2 2 3 

d,{p(-)) + 3^(pW = -(r^(p(") + If)) + (r^^)(p(«) + I-f) 

a 3 3 

-d,{F) - 4^(F) - (V„(rg(«)")) , (3.13) 



a 



with F = f ^(p*^"-* + p(")) + 27 ^gi"^f" + tt'^i^v'^v^ as before. We have not written 
Va(rg'^"^°') in terms of u" frame quantities, because it is suppressed due to statis- 
tical homogeneity and isotropy, like V^n". Vorticity does not appear in the above 
equations, because is hypersurface-orthogonal by construction. Were we to de- 
compose VisUa with respect to the u" frame, the vorticity of would (to leading 
order in v) emerge from Vafi'^ and ^^^R (gj) and (jsj). (For the definition of ^^^R 
for velocity fields which are not hypersurface-orthogonal, see [73].) In particular, 
the leading order contribution of the frame vorticity to the average Raychaudhuri 
equation (|3.7|) vanishes due to statistical homogeneity and isotropy, because it is con- 
tained in the boundary term (Vora"). (In Newtonian gravity, the vorticity combines 
with Q to give a boundary term, so backreaction vanishes for periodic boundary 
conditions and for statistical homogeneity and isotropy [74].) 
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Dropping the boundary terms VQ,(rg*^"'^") and V^ri" as well as all terms higher 
than first order in f , we have 

a 

+ (l(r2 _ 1)^2 ^ _ Y-^)Tdt9 + 0dtT) - 167rGN(gi"^t;°) (3-14) 
3^^ ^ 87rGN(p(^)) - l^^R) - + ^((r^ - 1)^') + 167rGN(gi")t;") (3.15) 

a 3 3 

-29i(g(")t;") - 8^(g(")t;°) . (3.16) 
a 



The dust approximation. One reason for deriving the general equations ( |3.11|) - 



( p.l3|) is to take into account deviations from the approximation of treating the 



matter as dust in the late universe. The importance of the different terms depends 
on the matter model, and cannot be determined from general arguments. However, 
it is possible to say what would would be necessary for the non-dust terms to have 
a significant effect. For the •h'^ terms to be important in ( ^.14D -( p.l6[ ), •fiafi"' would 



have to be of the order of the square of the expansion rate in a large fraction of space 
(contrary to what was argued in [75]; see also [76]), or the contraction of the energy 
flux and h°' would have to be of the order of the product of the average energy density 
and the average expansion rate. In order for the pressure or the anisotropic stress 
to be important, they would have to be on average of the same order of magnitude 
as the average energy density. For the time dilation to be important, the spatially 
varying part of F would have to be of order one in a fraction of space which is of 
order one. If the matter content is a gas of Standard Model particles plus cold or 
warm dark matter, and structures evolve from small adiabatic perturbations with a 
nearly scale-invariant spectrum, it seems unlikely that any of these conditions would 
be satisfied in the late universe when radiation pressure can be neglected. 

Let us assume that the matter is approximately dust in the frame, i.e. that 
p(u) ^ q{u)a ^ ^{u) j^a g^j^g small^, and the deviation of F from unity (and the time 
derivative of the deviation) is small, F = 1 — 5F, with |5F| <^ 1. When we drop 
all squares of small terms (whether they are non-dust terms or v'^), the equations 
(|;i4D-(|16D simplify to 

3^ ~ -47rGN(p^"^ + 3p(")) + Q 
a 

+ {vav'' + ^5r^2 ^ 26Tdt9 + 9dt6T) (3.17) 



^From the equations of motion it follows that A" is zero if p^"\ and Tr^^p are zero, and A" 
is small if p^"-*, n'^} as well as Vo.p'-"-' and V'^tt^'I'' are small. 
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3^ ^ 87rGN(p(")) - le^R) -Iq + liSre') (3.18) 

where we have used the fact that ^ A" — ?)" plus corrections of order v. Equations 
( ^.17D -( p.l9D give the leading corrections to the treatment of matter as irrotational 
dust, compared to the original Buchert equations [11]. In particular, they cover the 
case when the matter can be locally treated as dust, but has rotation, and is 
orthogonal to the hypersurface of constant proper time of observers comoving with 
the dust. Then is of order v, and we can choose F = 1, so the difference between 
and arises only from vorticity. We see that vorticity alone has (to leading 
order in v) no effect on the averages, because the dominant contribution comes from 
the total derivative Von". It does change the relation of the scale factor a to the 
geometry, because a will be defined with a different vector field, but not the relation 
of a to the redshift. 

For irrotational dust, the Raychaudhuri equation ( p.4| ) can be integrated as an 
inequality before averaging to obtain the bound Ht < 1 [46] (page 220), [77,78]. 
When rotation or non-dust terms are important, there is no such local inequality. 
Indeed, having 9 + |6'^ > locally is required in order for collapsing regions to 
stabilise. From a physical point of view, we would still expect to recover Ht < 
1 unless there is sustained acceleration in a significant fraction of space, but the 
conditions for this derived from ( |3.4| ) involve combinations of spatial averages and 
integrals over time, and are not entirely transparent. 

Note that in order for the approximation of treating the matter as dust on 
average to hold, it is only necessary that the contribution of the non-dust terms to 
the averages is smaller than that of the average energy density. It is not required 
that the energy-momentum tensor of matter could be locally approximated as dust 
everywhere. In fact, deviations from the irrotational dust behaviour are necessarily 
important on small scales. As the local Raychaudhuri equation ( |3.4| ) shows, in order 
to stabilise structures, a large or its gradient is needed. This can correspond to 
frame vorticity as with rotating baryonic structures, or the acceleration can be 
generated by anisotropic stress (or a pressure gradient or energy flux) as in the case 
of dark matter [26]. For dark matter, the dust approximation is locally invalid in 
structure formation due to multistreaming [25]. Nevertheless, as long as the volume 
occupied by regions where such terms are important is small, their contribution to the 
average is not important. In Newtonian calculations, this is certainly the case [26]. 

Approximating the matter content as dust on average does not imply viewing the 
matter as infinitesimal grains. For example, the issue of what the "particles" of the 
dust fluid are is sometimes raised, and whether one can consistently "renormalise" 
the scale of the description as larger stable structures form [14,20]. However, the 
dust approximation is properly understood as the statement that when considered 
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on large scales, the energy density dominates over the pressure, the energy flux, the 
anisotropic stress, and their gradients. 

It has been argued that because of gradients of spatial curvature, clocks in dif- 
ferent regions run at different rates, and that this effect is important for cosmology 
but neglected in the dust approximation [79]. Any such effects are accounted for 
in the present analysis, to the extent they are part of general relativity, and not 
outside of the geometrical optics approximation. If the n" frame is non-geodesic, 
different points on the hypersurface of statistical homogeneity and isotropy indeed 
have different values of the frame proper time (though this cannot be understood 
as being due to spatial curvature gradients), which in turn is close to the proper 
time measured by the observers if v is small. For this time dilation to be signiflcant, 
the acceleration ff would have to be of the order of the average expansion rate in 
a signiflcant fraction of space. This in turn requires that either the motion of the 
observers is very non-geodesic (large A"") or the acceleration between the two frames 
is signiflcant (large w"); the latter possibility however has to contend with the fact 
that the velocity between the frames cannot become large, as this would violate the 
small anisotropy observed in the CMB. In order to generate such large accelerations, 
the non-dust terms in the energy-momentum tensor would have to be signiflcant in a 
large fraction of space. This would likely have important cosmological effects apart 
from the time dilation. Note that it follows from statistical homogeneity and isotropy 
that the spatial difference in the CMB temperature between different regions is small, 
in contrast to the arguments made in [79]. This issue can be observationally probed 
with the blackbody shape of the CMB spectrum [59,80]. 

4. Discussion 
4.1 Modelling issues 

Discreteness. While we have kept the energy-momentum tensor generic, the ar- 
guments about light propagation in section § contain the implicit assumption that 
matter is so flnely distributed in space that it can be treated as a continuous distribu- 
tion which light rays sample. The redshift, the angular diameter distance and other 
quantities related to light propagation depend on the spacetime geometry only via an 
integral along the null geodesic. If the matter consists of discrete clumps whose size 
and number density is so small that a typical light ray will never encounter matter, 
the energy-momentum tensor along the light ray is zero, regardless of the average 
energy density (or pressure or other components). For example, the integrand in 
( |2.34| ) vanishes identically, and our arguments about cancellation between regions of 
low and high density do not apply. 

The approximation of discrete matter as a continuous fluid has been studied from 
flrst principles for the dynamics of matter in Newtonian gravity [81]. However, the 
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effect on light propagation has been looked at mostly from the perspective of adding 
perturbations to a FRW metric. With a statistically homogeneous and isotropic 
distribution and small structures, it is then not surprising that the deviation of the 
expansion rate from the FRW case is small [20]. One exception is [82], where the 
effect of discreteness light propagation was considered in a lattice model without any 
FRW approximation. 

If the light travels in vacuum, and we assume statistical homogeneity and isotropy, 
the mean angular diameter distance is given by ( p.35| ) with zero on the right-hand 
side. The equation can be integrated to yield 

Da= fdz , (4.1) 

Jo (l + zyH{z) 

where H and F are the mean expansion rate and the mean time dilation along the 
null geodesic, which in general do not coincide with the spatial averages. Because 
the null geodesic samples only vacuum, the expansion rate along the null geodesic is 
larger than the average expansion rate (assuming that the matter satisfies the strong 
energy condition). The equations ( p.4|) and ( p.5| ) show that if the acceleration and 
the shear aap could be neglected, the expansion rate sampled by the light ray would 
be the same as in an empty FRW universe. If the time dilation could be neglected, 
the angular diameter distance would then correspond to the 'coasting universe' (or 
Minkowski space, for non-expanding regions). However, it is probably not reasonable 
to neglect the shear aafs- For example, the existence of both expanding and non- 
expanding regions means that there is a gradient in the expansion rate, which implies 
non-zero shear. Evaluating the expansion rate along a null geodesic, and the angular 
diameter distance, is thus reduced to the question of realistically modelling the shear 
scalar (and the acceleration n") along the geodesic. 

Discussing light propagation in terms of null geodesies assumes the validity of 
the geometrical optics approximation. Geometrical optics is in turn based on mod- 
elling light as local plane waves, which requires the wavelength of the light to be 
much smaller than both the curvature scale and the scale over which the amplitude, 
wavelength and polarisation of the light change significantly. If the fraction of the 
volume occupied by matter is so small that light rays never come close to the matter 
particles, this is satisfied. However, if the light passes through small discrete re- 
gions where the energy-momentum tensor is non-zero, the situation is very different 
from the geometrical optics limit. (We are here concerned only with gravitational 
interactions, and are not taking into account gauge interactions between photons 
and matter.) In evaluating the validity of the continuous fluid approximation, the 
extension of the wave-packet of the matter particles should therefore be taken into 
account. The treatment of the photon waves should also be more detailed, instead 
of simply treating their transverse width as zero (as in the null geodesic picture) or 
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infinite (as implicit in the plane wave approximation). The effect of discreteness on 
light propagation is not obvious, and should be studied in a realistic model. 



The relevance of averages. The equations (p.7|) - (p.9|) or (|3.11|) - (|3.13|) generalise 
the average equations for the irrotational dust case derived by Buchert [11] to ar- 
bitrary matter content and rotation, and an arbitrary averaging hypersurface. The 
scale factor has been defined to give the mean redshift in the case that the difference 
between the observer frame and the frame of statistical homogeneity and isotropy 
is small, and we have assumed statistical homogeneity and isotropy and slow evolu- 
tion, which are required to have a meaningful notion of mean redshift. The equations 
(|3.7|)-(p^ or ( |3.11| )-( p.l3| ) are of course valid without any symmetry assumptions. 



However, the quantity is of limited use in interpreting observations unless the space 
is statistically homogeneous and isotropic, so that the change in the total volume of 
the spatial hypersurface is the dominant effect for light propagation. 

The system of equations (|3.7])-(P^) or ( |3.11D -( p.l3D is derived from the scalar 



part of the full Einstein equation, which is not closed, because it is coupled to the 
vector and tensor parts. As the sum of two tensors at different spacetime points is not 
a tensor, the average of a tensor (or vector) in curved spacetime is not well-defined, 
so it is sometimes said that the rest of the evolution equations cannot be averaged. 
However, we can write the evolution equations in terms of components, and average 
these. The problem is not lack of covariance^, but the fact that products of variables 
become independent correlation terms, so the number of unknowns increases when 
taking the average, and the set of average equations does not close. 

Methods for covariantly averaging tensors on curved spacetime have been sug- 
gested, including the macroscopic gravity formalism [84, 85] , the Ricci flow [86] , a 
statistical averaging formalism [87], a procedure which relies on a specific choice of 
tetrads [88], and the proposal of [89] which is more a way to rewrite the tensors 
than average them. However, the relevant issue is not the mathematical definition 
of averages in some covariant formalism which is an extension of general relativity 
or in a statistical ensemble of spacetimes. Rather, we want to determine the impact 
of structures in the spacetime which actually describes the universe we observe, with 
the dynamics determined by the Einstein equation and the local equations for light 
propagation. Averages are useful insofar as they provide an approximate description 
of observed quantities in this complex system. The relevant averaging procedure, and 
the hypersurface of averaging, emerges from considering observations, and cannot be 
determined on abstract mathematical grounds. 

It bears emphasising that taking the average on a different hypersurface would 
correspond to considering a different velocity field, and this is a physical choice. This 



^One can even average the vector and tensor part of the equations covariantly by first projecting 
with a vector field such as v"' or Vq0. The Einstein equation can be expressed in full generality in 
scalar form by using a projection [83]. 
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issue is separate from the question of gauge-invariance, i.e. dependence of unphysical 
quantities on the chosen coordinate system. The Buchert equations were originally 
derived using the ADM formalism [11], where the distinction between choice of veloc- 
ity field and choice of coordinates is not entirely transparent. However, the problem 
can be considered completely covariantly, without introducing coordinates [20,23,42]. 
The averages depend on the choice of the averaging hypersurface [29,90], but not 
on the coordinate system [9]^. The relevant velocity field is singled out as that of 
the observer by the redshift, and the relevant averaging hypersurface is given by the 
symmetry properties of the spacetime. 

Deviation from the FRW universe. If backreaction is important for the average 
expansion rate, i.e. if Q contributes significantly to (|3.7] ) and (|3.11| ), there is no "FRW 
background" that would emerge on large scales [14,20]. (Note that Q being small 
does not guarantee that the spacetime can be described by the FRW metric.) The 
FRW metric describes a universe that is exactly homogeneous and isotropic, not a 
universe where there are large non-linearities with a statistically homogeneous and 
isotropic distribution. 

While the deviation of the average expansion rate from the FRW equations 
could be attributed to an effective matter component in a FRW universe, this is not 
the case for other observables. For the shear scalar, this is obvious, because it is 
zero in FRW models, but generally positive. As a less trivial example, the spatial 
curvature in the FRW case is fixed to be proportional to a~^, while the average 
spatial curvature in an inhomogeneous and/or anisotropic space can evolve non- 
trivially [91]. In fact, if the matter can be treated as dust, the effect of backreaction 
is encoded in the non-trivial evolution of the spatial curvature [11,14,78]. (The 
difference between backreaction in general relativity and Newtonian gravity can also 
be understood in terms of the spatial curvature [20].) For this reason, calling the 
effect of backreaction a change of background as in [70,92,93] is misleading. A FRW 
model which reproduces the average expansion rate of a clumpy model is better 
called a "fitting model" or something similar. The metric associated with it cannot 
be used to calculate quantities other than the one specifically fitted for, and usual 
perturbation theory around it does not make sense. 

In particular, if backreaction is important, the relation between the expansion 
rate and the angular diameter distance given by ( f^.35D (assuming that discreteness is 
not important) is different from the FRW case [23]*^. Even though either the change 
of the expansion rate or the change of the distance due to backreaction can be 

''In [71], choice of the averaging hypersurface and the coordinate system was conflated. This 
mixes up defining a quantity of interest using physical criteria and using different coordinates to 
describe the same physics. 

*ln contrast, the relation = (1 + z)^Da for the luminosity distance Dl is universal [39], [49] 
(page 111), [94]. 



- 25 - 



reproduced in a FRW model by introducing extra sources of matter or changing the 
Einstein equation, it is not possible to do both at the same time, since FRW models 
cannot mimic the correlation ( |2.35| ). In a clumpy space, if the dust approximation 
holds, the distance is uniquely determined by the function H{z) and the matter 
density today [23]. (Note that fitting the distance observations may not necessarily 
require accelerating expansion, because the relation between H and Da is different 
from the FRW case.) Analogously, in a FRW universe with general matter content, 
the distance is determined by H{z) and the spatial curvature today [95]. In LTB 
models, the relationship is different from either of these cases [56]. 

This prediction for both FRW models and backreaction can be tested with inde- 
pendent observations of distance and the expansion rate [96]. The deviation of the 
backreaction case from the FRW consistency condition is related to the difference of 
the average expansion rate from the FRW case with vacuum energy and dust [23]. 
There are relatively good constraints on the distance scale as a function of redshift 
from type la supernova observations [97], but measurements of the expansion rate 
using the ages of passively evolving galaxies are less precise [98]. The expansion 
rate at different redshifts also enters into the radial mode of the baryon acoustic 
oscillations, and a measurement was reported in [99] (see also [35,100]). With bet- 
ter observations of the expansion rate, it will be possible to more tightly test the 
statement that the universe is described by a FRW metric, independent of the possi- 
ble existence of exotic matter or modified gravity [95, 101]^. Similarly, backreaction 
can be tested without having a prediction for the average expansion rate. In the 
case of backreaction, it helps that there are independent observational constraints 
on the matter density today, while the only way to determine the spatial curvature 
of a FRW universe is to make independent measurements of the expansion rate and 
distance, and use the FRW relation between them. (In particular, the CMB has no 
model- independent sensitivity to the spatial curvature.) 

4.2 Conclusions 

Summary. In [23] it was argued that light propagation can be approximately de- 
scribed in terms of the overall geometry (meaning the average expansion rate and 
other average scalar quantities) in statistically homogeneous and isotropic dust uni- 
verses where the distribution evolves slowly compared to the time it takes for light to 
cross the homogeneity scale. The calculation was incomplete because it was simply 
assumed that the light deflection is small, and there was no result for the amplitude 
of the image shear. It was also assumed that the matter is dust and irrotational, 
while it is known that such a description does not locally hold everywhere. 

^Note that this is not a test of the Copernican principle. The statement that our position in 
the universe is not untypical is different from the statement that the metric is FRW. In fact, the 
Copernican principle says nothing about the metric. 
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We have now considered general matter content, with a general observer veloc- 
ity and a general hypersurface of statistical homogeneity and isotropy, with a slowly 
evolving distribution. From these assumptions about the spacetime symmetry, we 
find that the propagation of typical light rays over distances longer than the homo- 
geneity scale can to leading order be treated in terms of a few average quantities, as 
in the irrotational dust case. The redshift is given by the average volume expansion 
rate of the hypersurface of statistical homogeneity and isotropy, assuming that the 
velocity difference between the frame of statistical homogeneity and isotropy and the 
observer frame is non-relativistic. The relevant averaging hypersurface is selected 
by the symmetry of the situation as the one of statistical homogeneity and isotropy, 
while the relevant velocity is that of the observers, because it gives the observed 
redshift. The angular diameter distance is to leading order determined by the av- 
erages of the expansion rate, time dilation, energy density and pressure. The light 
defiection and the image shear are small. 

We have also written down the generalisation of the Buchert equations [11] for 
the evolution of the average expansion rate to general matter content and averaging 
hypersurface. Provided that the difference between frame of statistical homogeneity 
and isotropy and the observer frame is small, and that pressure, energy flux and 
anisotropic stress are not signiflcant in a large fraction of space, we recover the 
Buchert equations for dust. 



Outlook. If backreaction has a large effect on the average expansion rate, the 
relation between the expansion rate and the angular diameter distance is different 
from the FRW case, and the difference grows with the deviation of the expansion 
rate from that of the FRW model with dust and vacuum energy. This is a distinct 
prediction of backreaction, which cannot be mimicked by any FRW model [23]. 

Our arguments are qualitative, and should be followed up by a more rigorous 
quantitative study. In particular, evaluating the deviations from the mean is neces- 
sary to study lensing and the low multipoles of the CMB. The equations we have 
written in the covariant formalism contain all general relativistic effects and are ex- 
act, except for the geometrical optics approximation. The study of light propagation 
in a general spacetime is reduced to the system of coupled ordinary differential equa- 
tions ( |2.15| ), ( |2.23| ), ( |2.31| ) and ( |2.32| ). To obtain a solution, we do not have to know 
the global geometry, it is only necessary to specify the distribution along the null 
geodesic. This approach was used in the perturbative case in [102]. This formulation 
is well-suited to slowly evolving statistically homogeneous and isotropic universes, 
where the solution is expected to depend only on the statistical properties of the dis- 
tribution. In particular, because in the real universe the distribution originates from 
small almost Gaussian fluctuations, its statistics are even at late times determined 
by the initial power spectrum, processed by gravity. The effect of the discreteness of 
the matter content of the universe on light propagation should be clarifled. Finally, 
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even though the relation between the expansion rate and the distance is aheady a 
prediction which can be checked, it remains of central importance to derive the aver- 
age expansion rate from the statistics of structure formation [20] in a reliable manner 
to allow easier and more comprehensive comparison with observations. 
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